I am working on a scraper which gets the data and then i am parsing the text from the comments looking for specific keywords. Now because comments are really unstructured, it is very much possible to have information about towns i lived in various formats like "lived in Harrisburg", "lived in Harrisburg, NYC and Chatsworth" etc. My question is that, is there any library or package which can come really handy in such type of processing of unstructured data? As of now I am coding the parsing logic myself.
Again, thanks in advance!
You must be logged in to post. Please login or register an account.
I don't know of any package that does this for you. Unstructured data sorting is so highly variable that there's really no way for someone to make something in advanced to suit your needs.
-Harrison 7 years ago
You must be logged in to post. Please login or register an account.